Inside-Outside Estimation Meets Dynamic EM
نویسنده
چکیده
We briefly review the inside-outside and EM algorithm for probabilistic context-free grammars. As a result, we formally prove that inside-outside estimation is a dynamic-programming variant of EM. This is interesting in its own right, but even more when considered in a theoretical context since the wellknown convergence behavior of inside-outside estimation has been confirmed by many experiments but apparently has never been formally proved. However, being a version of EM, inside-outside estimation also inherits the good convergence behavior of EM. Therefore, the as yet imperfect line of argumentation can be transformed into a coherent proof. 1 Inside-Outside Estimation The modern inside-outside algorithm was introduced by [4] who reviewed an algorithm proposed by [1] and extended it to an iterative training method for probabilistic context-free grammars enabling the use of unrestricted free text. In the following, y1 . . . yN are numbered (but unannotated) sentences. Definition: Inside-outside re-estimation formulas for probabilistic context-free grammars in Chomsky normal form are given by (see [4], but see also [1] for the special case N = 1): p̂(A → a) := ∑yN w=y1 Cw(A → a) ∑yN w=y1 Cw(A) , and p̂(A → BC) := ∑yN w=y1 Cw(A → BC) ∑yN w=y1 Cw(A) . The key variables of this definition are so-called category and rule counts: Cw(A) := 1 P ∑n s=1 ∑n t=s e(s, t, A) · f(s, t, A), Cw(A → a) := 1 P ∑ 1≤t≤n, wt=a e(t, t, A) · f(t, t, A), and Cw(A → BC) := 1 P ∑n−1 s=1 ∑n t=s+1 ∑t−1 r=s p(A → BC)e(s, r, B)e(r + 1, t, C)f(s, t, A) which are computed for each sentence w := w1 . . . wn with so-called inside and outside probabilities: An inside probability is defined as the probability of category A generating observations ws . . . wt, i.e. e(s, t, A) := p(A ⇒ ∗ ws . . . wt). In determining a recursive procedure for calculating e, two cases must be considered: • (s = t): Only one observation is emitted and therefore a rule of the form A → ws applies: e(s, s, A) = p(A → ws), if (A → ws) ∈ G (and 0, otherwise). • (s < t): In this case we know that rules of the form A → BC must apply since more than one observation is involved. Thus, e(s, t, A) can be expressed as follows: e(s, t, A) =
منابع مشابه
The Inside-Outside Algorithm
This note describes the inside-outside algorithm. The inside-outside algorithm has very important applications to statistical models based on context-free grammars. In particular, it is used in EM estimation of probabilistic context-free grammars, and it is used in estimation of discriminative models for context-free parsing. As we will see, the inside-outside algorithm has many similarities to...
متن کاملInfluence of Formulation Parameters on the Release of Diclofenace Sodium from Matrices with Manufacturing Formulation Ingredients
Effects of formulation parameters on the fractional release profile of diclofenac sodium from matrices having the manufacturing formulation ingredients are studied. As a content of cetyl alcohol (rate controlling agent) in the matrix increases, the fractional release decreases. The fractional release increases either by increasing sucrose content outside the granule or by decreasing sucrose...
متن کاملPatch Enclosure and Localized Effects of Selected Acacia Species on Herbaceous Richness and Soil Properties of Rangelands in Somali Regional State in Ethiopia
Enclosure and Acacia shade availability to plants are basic variable in arid and semi-arid rangelands. the aim of this study was investigation the impact of patch enclosure and Acacia shade using four treatments which are Inside Enclosure Under Acacia shade (IEUA), Inside Enclosure Without Acacia shade (IEWA), Outside Enclosure Under Acacia shade (OEUA) and Outside Enclosure W...
متن کاملStochastic Analysis of Lexical and Semantic Enhanced Structural Language Model
In this paper, we present a directed Markov random field model that integrates trigram models, structural language models (SLM) and probabilistic latent semantic analysis (PLSA) for the purpose of statistical language modeling. The SLM is essentially a generalization of shift-reduce probabilistic push-down automata thus more complex and powerful than probabilistic context free grammars (PCFGs)....
متن کاملIn-Between Space, Dialectic of Inside and Outside in Architecture
Defining space by dividing it to inside and outside is one of human’s ways to recognize his positionin environment. Architecture is created to response to this need for inside/outside spaces. Design of inside and outsideSpaces and relation between them always has been one of necessities for definition and limitation of human livingspaces, but little attention to relation of this two spatial rea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/cs/0412016 شماره
صفحات -
تاریخ انتشار 2001